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1 Introduction 


Viruses are natural vectors for nucleic acids, and both DNA and RNA plant 
viruses have been engineered to extend or replace conventional vectors for 
delivery of gene-editing reagents. This chapter reviews aspects of viral biology 
essential for engineering vectors, highlight landmark studies using viruses 
to overcome traditional limitations in gene editing, and outline important 
considerations for the use of viral vectors in new systems or for new targets. 

Motivated by fundamental differences in both their infection modes and 
utility as vectors, DNA and RNA viruses are treated separately. DNA viruses 
are assessed as replicon vectors for efficient gene editing through homology- 
directed repair (HDR). The chapter then reviews RNA viruses as mobile vectors 
for gene-editing reagent delivery. The chapter includes key case studies as well 
as future trends in research. 
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2 Viruses as vectors for the delivery of gene-editing reagents 


2 DNA viruses: replicon vectors for efficient gene editing 
through homology-directed repair (HDR) 


Since the first reports over 25 years ago, various plant DNA viruses have been 
sequenced, cloned, and modified to introduce exogenous sequences into plant 
cells. More recently, delivery of sequences encoding components for the major 
classes of site-specific nucleases (SSNs) has demonstrated the utility of DNA 
viral vectors for gene editing in a broad range of host species. Among both 
foundational studies in viral engineering and contemporary studies in gene 
editing, the vast majority describe work with Geminiviridae: single-stranded 
DNA (ssDNA) viruses that constitute the largest known family of DNA viruses 
in plants’. 

Engineered geminiviral clones may be delivered to plant cells by biolistics 
or, more commonly, by agroinfection. While some infectious clones retain 
mobility within the delivered tissue (and potentially to natural insect vectors)’, 
most reports describe clones deconstructed into non-infectious, geminiviral 
replicons (GVRs): self-replicating, cell-autonomous DNA vectors, which 
transiently drive their genetic payload to high-copy number and overexpression 
in host cells, regardless of whether the source vectors integrate into the 
nuclear genome. Investigation of GVRs as overexpression vectors predates the 
widespread availability of gene-editing tools, but their utility for editing was 
quickly realized. In particular, GVRs offer key advantages for precision-editing 
strategies requiring homology-directed repair (HDR) from a donor template 
molecule to incorporate specific modifications into the genome. 

When used to deliver SSNs and donor templates, GVRs dramatically 
increase the frequency of HDR in host cells°. Intuitively, stronger expression of 
SSNs may lead to more efficient double-stranded break induction. GVRs also 
increase the local concentration of donor template molecules available for 
recombination, and pleiotropic effects of viral replication proteins render the 
cell environment more amenable to HDR. In the following sections, we describe 
the basic structure and function of GVRs, consider vector design choices 
influencing their activity within plant cells, and review case studies using GVRs 
to drive efficient HDR in several species of both dicots and monocots. 


3 Geminiviral replicons (GVRs): deconstructed gemini- 
viruses that serve as replicating vectors in plants 


3.1 Geminiviruses 


Geminiviridae are a family of circular, ssDNA viruses, vectored by insects and 
infecting diverse host plants. The name derives from the twin-icosahedral 
shape of the encapsulated virion. The viral ssDNA genome is converted by 
host cell polymerases into a double-stranded DNA (dsDNA) intermediate, 
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which serves as a template for rolling-circle replication and for transcription of 
viral gene products'. With small genomes, encoding just a few open-reading 
frames (ORFs), geminiviruses require host cell machinery to complete their life 
cycle; this reliance underscores complex interactions between viral and host 
factors*, which are important to consider for the function of geminiviral vectors 
in plant cells. Geminiviruses encompass at least nine distinct taxa. Historically 
demarcated by host range or by variations in genome structure', more complex 
phylogenetic relationships are suggested due to viruses identified through 
high-throughput metagenomics*’, or isolates that challenge traditional 
structure-based classifications’. 

Despite the increasing availability of novel sequence isolates, the best- 
studied taxa remain Begomoviruses and Mastreviruses. The attention paid 
to these genera can be attributed to their status as significant pathogens of 
agricultural crops, including cereals, beans, cassava, and tomato. Most studies 
of the molecular biology of geminiviruses, and of their use as replicon vectors, 
have focused on three species of mastreviruses: Maize Streak Virus (MSV) and 
Wheat Dwarf Virus (WDV), infecting monocots (as do most mastreviruses), 
and Bean Yellow Dwarf Virus (BeYDV), from an outgroup infecting dicots. 
Begomoviruses may have monopartite or bipartite genomes; mastreviruses, 
with rare exceptions,® are monopartite. As nearly all GVRs used for gene editing 
are derived from WDV and BeYDV, this chapter focuses on mastreviruses. 


3.2 Mastrevirus genome structure, function, and replication 


The canonical mastrevirus genome is 2.6-2.8kb' (Fig. 1a). Two structural 
sequences termed the short (SIR) and long (LIR) intergenic regions separate 
reading frames in both orientations of the dsDNA intermediate, on the virion- 
sense (v-sense; +) and complementary-sense (c-sense; -) strands. A c-sense 
DNA primer molecule within the SIR, unique to mastreviruses, is thought to 
function as the origin for the first stage of viral replication: conversion of ssDNA 
to dsDNA by synthesis of the complementary strand. The origin for rolling-circle 
replication, conserved across nearly all geminiviruses, is anonanucleotide stem 
loop sequence within the LIR that is nicked by the replication protein, Rep’. The 
LIR and SIR function as a bidirectional promoter and terminator, respectively, 
for the transcripts from both strands. 

On the c-sense strand, alternative splicing of a 3’ intron gives rise to two 
protein-coding transcripts with overlapping reading frames: the unspliced 5’ 
transcript (RepA) and spliced transcript (Rep). Despite sharing much of their 
N-terminal sequence, studies in both monocot- and dicot-infecting mastrevirus 
species implicate these proteins in distinct and conserved functional roles. 
C-terminal deletion and splice-site mutants indicate the spliced transcript and 
minor product, Rep, is both necessary and sufficient for replication in host 
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Figure 1 Structure and delivery of GVRs.(a) The dsDNA genome ofa canonical mastrevirus. 
The double-stranded origin (DSO) for rolling circle replication is a nonanucleotide 
stem loop sequence, TAATATTAC, within the long intergenic region (LIR; ~0.4 kb). LIR- 
promoter activity drives transcription of the v-sense gene products (MP, ~0.3 kb; CP, 
~0.8 kb) and c-sense gene products (Rep, ~1.1 kb; RepA, ~0.8 kb). The short intergenic 
region (SIR, ~0.2 kb) terminates transcription of both sets of transcripts. (b) Geminiviral 
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cells1™12, While not strictly required for replication, the unspliced transcript and 
major product, RepA, likely contributes to regulation of replication through 
interactions with both host and viral factors. First, RepA interacts with the host cell 
retinoblastoma-related protein, a highly conserved regulator of the cell cycle. By 
driving mitotically inactive somatic cells into a pseudo S-phase, mastreviruses 
upregulate the expression of host factors required for viral replication’. Second, 
RepA appears to be an obligate trans-activator for transcription of v-sense gene 
products. By supporting production of the capsid protein to encapsulate newly 
synthesized genomes (thus sequestering them from further replication), RepA 
may contribute to the downregulation of genome replication in live viruses”. 
Additional interactions between Rep and RepA, and their mechanistic roles in 
replication, have been the subject of considerable study, but are beyond the 
scope of this review. For an in-depth treatment of geminiviral replication, see 
Gutierrez?'4"5, 

On the v-sense strand, sequential reading frames encode the viral 
movement protein (MP) and capsid or coat protein (CP). These proteins are 
critical for infection. In addition to directing nuclear import of the virion to 
begin the replication cycle, the virion capsid is required for insect transmission 
and efficient systemic movement. The movement protein, which localizes to the 
cell periphery, mediates interactions between plasmodesmata and the virion 
capsid to facilitate movement among plant host cells'®. In one classic example 
of maize agroinoculation with MSV clones, deletions of the CP and MP, or CP 
alone, abolished systemic movement, insect transmission, and typical disease 
symptoms versus a wild-type control. Cell-autonomous replication of these 
deletion vectors, however, was still detectable’. 


3.3 Deconstructing geminiviruses into replicon vectors 


Experiments with mutants to elucidate wild-type gene function were soon 
followed by reports describing the addition of exogenous sequences, typically 
by replacing the v-sense gene products with reporter or marker gene cassettes. 
By maintaining components essential for viral replication (the LIR, SIR, and 
replication proteins) and eliminating components required for transmission 


Figure 1 (Continued) 

replicons (GVRs) can be delivered to cells in different architectures. When Rep is in its 
native position in cis to the replicon, driven by c-sense promoter activity of the LIR, it 
can be expressed from both the source vector and the high-copy replicon (hundreds 
to thousands of dsDNA intermediates per cell), creating a positive feedback loop for 
replication. When placed in trans to the linear LSL sector of the source vector, continued 
Rep expression depends upon persistent expression (i.e. from stable integration) of the 
source vector. 
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and infection (MP, CP), these clones function as cell-autonomous replicons. 
To date, replicons have been derived from mastreviruses including WDV1617, 
MSV181°, and BeYDV?°2", Although not a focus of this review, it should be noted 
that replicons have also been derived from begomoviruses including Tomato 
Golden Mosaic Virus (TGMV)??3, Bean Dwarf Mosaic Virus (BDMV)**, Tomato 
Leaf Curl Virus (ToLCV)*5?6, and Cabbage Leaf Curl Virus (CaLCuV). In the 
context of plant viral expression vectors, replicons represent a transition from a 
‘full virus’ strategy to a ‘deconstructed virus’ strategy”, which has become the 
archetype for DNA viral expression vectors. 

In moving away from a wild-type viral architecture, synthetic clones open 
up new design possibilities (Fig. 1b). First, natural mastrevirus genomes 
(~2.7 kb) are tightly size-constrained by the need to fit within the capsid, and for 
the virions to traverse plasmodesmata for systemic movement within the host. 
Removal of the MP and CP sequences alone (~1 kb) reduces the size of the 
replicon, but removal of requirements for encapsulation and cell-cell movement 
effectively shifts the upper bound for replicon size to the limit imposed by 
processive replication. This has the effect of greatly increasing vector ‘payload’ 
capacity versus infectious, full-virus clones. While no studies have systematically 
explored an upper limit to replicon size, evidence suggests that the efficiency 
of replication (and thus copy number) decreases with the replicon size”®. In our 
own group, we have transformed functional BeYDV and WDV replicon vectors 
of approximately 15 kb. Finally, even independent of encapsulation, the size 
of plasmodesmata exerts a significant selective pressure on viral genome size, 
such that cell-cell mobility of larger plasmids is compromised”. 

Another design choice in the ‘deconstructed’ strategy is the placement of 
the replication proteins. In the native virus, both ORFs are driven by the c-sense 
promoter of the LIR, and so are expressed in cis to the replicon. However, 
these proteins can also be supplied in trans, by placing the ORFs under a 
heterologous promoter on the source vector or a second vector. This separation 
provides additional degrees of freedom for control over replicon activity, 
by ‘tethering’ expression of the replication protein to the source vector. For 
example, replication may be constrained by a chemically inducible promoter,” 
or by obligate co-delivery of two vectors.° Finally, as noted earlier, only Rep is 
strictly required for replication. While some synthetic clones purposed solely 
for overexpression eschew RepA,” the pleiotropic benefits for replication 
and HDR in somatic cells conferred by RepA have motivated its inclusion in all 
reports of gene-editing vectors to date. 


3.4 Delivery and expression of replicons in plant cells 


Replication-competent geminiviral clones can be delivered into plant cells as 
plasmids or as linear dsDNA fragments by traditional means of transformation: 
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PEG-treated protoplasts?’, biolistics?®, or agroinfection. In early reports 
describing agroinfection of begomovirus*° and mastrevirus?!32 clones, source 
vectors included full tandem repeats of the viral genome. Such repeats 
might enable escape and replication of a unit-length viral genome from the 
source vector by either of two, non-exclusive mechanisms: homologous 
recombination between repeats or by a replicative intermediate copied from 
the dsDNA source vector. It was later shown that duplication of just the v-sense 
ORI within the LIR is sufficient, implicating a rolling-circle replication mediated 
‘replicational release’ mechanism® for viral escape. Subsequent reports largely 
follow a LIR-SIR-LIR (LSL) architecture for delivery of viral genomes from a dsDNA 
source vector. A benefit of this architecture and delivery mechanism is that the 
population of replicating viral genomes is both homogenous and predictable, 
consisting of the sequence between the origins within the duplicated LIRs. By 
contrast, circular viral replicons produced by homologous recombination, while 
observed, are far more likely to be heterogenous populations. Importantly, 
recombination of T-DNA source vectors may be stimulated by Rep-mediated 
viral replication, whether due to the availability of replicated sequences, or to 
potential crosstalk between replication and recombination machinery**. 

Upon delivery to plant cells, replicon copy number and cargo expression 
typically peak within a few days to a week after infection®*®. While this transient 
activity does not strictly require stable integration of the source vector 
(especially for autonomous replicons: see Fig. 1b), integration can lead to 
persistent replicon activity over time through mitotic and meiotic cell lineages. 
Studies have detected circular and high-copy replicons in the somatic tissues 
of T, rice and tomato plants regenerated through tissue culture?®3¢, and there 
is evidence for persistence of a BeYDV replicon among the transgenic progeny 
of Arabidopsis lines as far as the T, generation”. 


4 Replicon vectors for efficient homology-directed repair 


Precise gene targeting (GT), encompassing site-specific sequence alterations 
and targeted insertions, is typically achieved by HDR with a donor template 
molecule. Since induction of double-stranded breaks (DSBs) greatly increases 
the frequency of HDR in plant cells?, SSNs including zinc-finger nucleases 
(ZFNs)??, TAL-effector nucleases (TALENs)*°, and CRISPR/Cas nucleases have 
been used to stimulate HDR and recover stable GT events. However, the 
most common outcome from SSN-mediated DSB induction is mutagenesis, 
whereas GT events occur typically an order of magnitude less frequently. This 
is attributed to the predominance of alternative DNA repair pathways, namely 
microhomology-mediated end joining and non-homologous end joining, and 
to the additional requirement for HDR that DSB induction be coordinated with 
availability of the donor molecule. 
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GVRs contribute to higher frequencies of HDR in several ways. Perhaps the 
most significant is by greatly increasing availability of the donor molecule. With 
traditional T-DNA vectors, at most several copies of the donor are delivered 
to the cell. Biolistics can deliver a greater amount of donor, but the likelihood 
of multi-copy, random integration makes this approach less appealing for 
applications in breeding where segregation of integrated reagents is likely 
required. GVRs can be delivered to cells on T-DNA vectors, but increase the 
copy number of the donor molecule to hundreds or thousands within a single 
cell nucleus. Additionally, the activity of the replication proteins contributes 
to a more favorable cell environment for HDR. The prevalence of DNA repair 
pathways is highly influenced by the cell state, and HDR is particularly low in 
the mitotically inactive somatic cells which are primary targets for regeneration 
in many species. Even in the absence of DSB induction or a synthetic repair 
template, geminiviral infection was observed to enhance somatic HDR in a 
transgenic reporter line of Arabidopsis*". 

Besides the known influence of RepA on cell state, the replication proteins 
may also have evolved a role to promote recombination, due to an evolutionary 
selective pressure to generate new viral variants. For example, Rep and/or RepA 
may mediate recombination by their affinity for host factors. This could take 
the form of crosstalk between replication and recombination machinery™, or 
even of more direct recruitment: one report from the begomovirus, Mungbean 
yellow mosaic virus (MYMIV), identified a protein-protein interaction between 
the MYMIV Rep and Arabidopsis RAD51, a highly conserved protein mediating 
homologous recombination in eukaryotes. While no such direct interactions 
have been described for mastreviruses, it is conceivable that analogous links 
between viral replication proteins and host recombination factors contribute to 
increasing the recombination potential of sequences carried on GVRs. 

Regardless of mechanism, GVR vectors typically increase HDR frequencies 
by an order of magnitude. In the following case studies, we will highlight 
notable results and conclusions from reports establishing GVR-based editing 
strategies in dicots and monocots. 


5 Case studies: GVR-based editing strategies in dicots and 
monocots 


5.1 Baltes et al. (2014): DNA replicons for plant genome 
engineering? 


Baltes et al. (2014) published the first report demonstrating the utility of 
GVRs for HDR. They used a previously established assay, leveraging a broken 
nptll::gus fusion gene present in a transgenic tobacco reporter line}. A ZFN 
pair targeting the middle of the broken fusion gene stimulates DSB induction, 
and templated repair completes the 5'nptll and 3’ gus sequences, restoring the 
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function of both markers. Positive GUS staining in agroinfiltrated leaves from the 
reporter line thus indicates HDR, and the density of positive cells within sectors 
of an infiltrated leaf provides a proxy for gene-targeting frequency. Using this 
assay, Baltes and colleagues found that the GVR system outperformed the 
standard T-DNA by as much as two orders of magnitude. They also tested what 
specific benefit might be attributed to enhanced SSN expression (putatively, via 
greater DSB induction), or to replication of the repair template, and found that 
gene-targeting frequencies were boosted only when the repair template was 
replicated. Removing the ZFNs entirely attenuated GUS staining, but placement 
of the ZFN pair inside or outside the LSL segment of the same T-DNA made 
no significant difference. Taken together, these results suggest DSB induction 
is essential but not rate-limiting to higher gene-targeting frequencies versus 
repair template copy number. Next, possible contributions from pleiotropic 
activity of the replication proteins were assessed. The gene-targeting frequency 
in a non-replicating T-DNA control was significantly increased by co-delivery of 
a plasmid expressing Rep and RepA, consistent with a role for these proteins 
in enhancing HDR independent from GVR replication. Finally, using kanamycin 
selection to identify cells in which the nptll marker was repaired through HDR, 
Baltes and colleagues were able to regenerate tobacco plants with sequence- 
confirmed HDR events, demonstrating the utility of the GVR-based approach to 
generate heritable events. 


5.2 Cermak et al.”°: High-frequency, precision modification of 
the tomato genome 


Cermak etal. used a similar strategy to recover edited tomato plants, targeting 
knock-in of a selectable marker. Unlike Baltes et al., however, they targeted 
an endogenous locus: ANT1, a transcription factor controlling anthocyanin 
accumulation. The purple pigmentation arising from ANT1 overexpression 
provides a phenotype that can be scored non-destructively, unlike GUS, and 
is thus useful for selection in vivo. To create an HDR reporter, Cermak and 
colleagues targeted TALENs or CRISPR/Cas9 reagents to the ANT1 promoter. 
Their repair template contained, in sequence, a nptll cassette, and an additional 
strong promoter to drive overexpression of the native ANT7 gene. Vectors were 
delivered to tomato cotyledons by agroinfection. The fraction of cotyledons 
showing purple spots, normalized for transformation efficiency, provided 
an estimate for gene-targeting efficiency. The best treatment among their 
experiments was a single-vector GVR with Cas? and sgRNA, which achieved 
a normalized gene-targeting frequency of 11.66%. This was an order of 
magnitude better than the conventional T-DNA treatment at 1.27%. Importantly, 
these experiments were carried out using kanamycin selection in the growth 
media; when cultured on non-selective media, no purple spots were observed, 
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even with the best GVR treatment. While kanamycin was used in this initial 
phase of tissue culture to recover growing purple calli, visual selection alone 
proved sufficient to then regenerate plants with gene-targeting events; over 
two dozen plants were regenerated from several purple calli in the absence of 
kanamycin. Subsequent molecular analyses revealed events including perfect 
repair at both junctions, and one-sided targeting events that still gave rise 
to the phenotype. Most surprisingly, both PCR and Southern assays failed to 
detect the presence of the T-DNA or extrachromosomal replicon among any 
of the regenerated plants. This suggested that transient expression of the 
gene-editing reagents was sufficient, and nuclear integration was not required, 
to recover events. This is a notable result for some breeding systems where 
transgenesis may be undesirable, either for regulatory concerns, or due to 
difficulty in segregating integrated reagents from the desired edit (e.g. in 
clonally propagated species). Finally, Cermak and colleagues showed their 
gene-targeting events were heritable. T, progeny scored visually and genotyped 
by PCR approximated Mendelian expectation. A fitness penalty incurred from 
anthocyanin overproduction might explain the slight segregation bias against 
the edited allele, and might also have contributed to their failure to recover 
purple growths from transformed cotyledons in the absence of kanamycin 
selection. 


5.3 Dahan-Meier et al.*°: Efficient in planta gene targeting in 
tomato using geminiviral replicons and the CRISPR/Cas9 
system 


Both of the previous studies used HDR to activate chemical and phenotypic 
selectable markers, a requirement which limits utility in many breeding 
applications. Dahan-Meier et al.°° developed a new strategy, demonstrating 
efficient and heritable gene targeting for a ‘neutral’ trait in tomato. Instead 
of selecting for their targeted knock-in, they created transgenic plants that 
carried a T-DNA with the editing reagents, combining an in planta approach 
to gene targeting} with a GVR vector. Their target was a mutant allele of 
the CRTISO locus, which encodes an enzyme in the carotenoid biosynthesis 
pathway. Whereas wild-type CRTISO plants have red fruits, their target line 
featured a 281 bp deletion in CRTISO. This recessive, loss-of-function mutation 
gives rise to orange fruits in homozygotes, which can be restored to red 
fruit through HDR-mediated correction of a single allele. While providing a 
scorable phenotype, the fruit color trait is only visible in mature plants; this 
contrasts to the anthocyanin marker used in Cermak et al., which permitted 
enrichment of events during the primary tissue culture step. The experimental 
T-DNA contained a nptl! marker for transgenic selection, Cas? and a sgRNA 
targeting the mutant allele, and a BeYDV GVR (with Rep in trans) to amplify 
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the donor sequence. Among 36 regenerated T, plants, nine had entirely red 
fruit. Molecular evidence from junction PCR and Southern blots confirmed 
HDR in sampled leaf tissue. Of these nine T,, eight fertile plants showed 
Mendelian segregation for the fruit color phenotype among their T, progeny, 
demonstrating heritability of these gene-targeting events. In contrast to the 
GVR treatment, only one, chimeric plant, with both red and orange fruits, was 
recovered among the 12 T, plants regenerated with a control construct that 
lacked the Rep cassette (thus disabling GVR replication). T, progeny from 
these red fruits segregated for the fruit color trait, indicative of a heterozygous 
knock-in, whereas no red fruits came from the orange T, fruits from the same 
chimeric plant. 

The above results underscore a crucial point: in order to recover heritable 
gene-targeting events, the edit must occur in a germline cell, or germinal 
cell linage. The targeted ANT7 tomato lines in the Cermak study were edited 
soon after transformation, as indicated by the purple phenotype already 
visible at the callus stage. In the Dahan-Meier study, the low rate of chimerism 
evidenced by uniform fruit color and Mendelian segregation of T, progeny in 
the replicon treatment similarly indicated that most of these events occurred 
early in development. By contrast, Shan et al.?” failed to recover heritable 
gene-targeting events in Arabidopsis using a BeYDV GVR, despite a high 
frequency of somatic editing among transformed plants. In the previous 
studies, the transgenic tobacco or tomato plants were regenerated through 
tissue culture of somatic explants, involving organogenesis and establishment 
of a new germline. In Arabidopsis, the floral dip protocols used directly 
transformed existing germline cells. It is possible that established germline 
cells restrict viral activity more stringently than somatic cells. Alternatively, it 
may be that additional unknown barriers prevent efficient GVR-mediated HDR 
in Arabidopsis; another report failed to observe even efficient somatic gene 
targeting, again using a BeYDV replicon in Arabidopsis**. However, these 
results stand in contrast to the majority of published studies in other dicot 
species. 


5.4 Gil-Humanes et al.” and Wang et al.?°: Gene targeting in 
cereals with WDV 


Soon after the first gene-editing reports with BeYDV, two studies described 
knock-ins in cereals using WDV. Gil-Humanes et al.’ targeted fluorescent 
reporters to the wheat genome. Cas? and a sgRNA targeted the endogenous 
ubiquitin locus, while a repair template included the coding sequence for GFP 
behind a T2A self-cleaving peptide linker. HDR of the break site placed the GFP 
gene in-frame with the native ubiquitin gene, leading to reporter expression 
driven by the native promoter. In protoplast transfections, green fluorescing 
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cells were identified only in treatments placing all of the editing reagents on 
the WDV replicon; in the best case, with a strong promoter for Cas9, 3.8% 
of cells underwent the targeting event. Additional protoplast experiments 
demonstrated multiplexed gene targeting using a BFP tag at a second 
locus, and molecularly confirmed targeting at all three of the wheat ubiquitin 
homeoalleles. Despite these promising results, Gil-Humanes and colleagues 
were unable to recover plants with the gene-targeting event. They did observe 
reporter positives in biolistic transfection of embryonic scutella, a typical system 
for tissue culture of wheat. However, these cells could not be regenerated even 
under chemical selection for the biolistic vector or for an alternative knock-in of 
the bar selectable marker gene (Gil Humanes et al., unpublished). 

Wang et al. described gene targeting in rice using selectable and 
screenable markers. Similar to Gil-Humanes et al., they used Cas9 and a 
sgRNA to knock in markers behind the promoter of either of two highly 
expressed native loci. In this case, a nptll::GFP fusion gene provided means 
to select for HDR events in agroinfected-rice calli. Geneticin-resistant 
plantlets were recovered through tissue culture, and screened by PCR for 
correct repair at the repair template junctions. Two different rice lines were 
transformed with test constructs: a transgenic line constitutively expressing 
Cas9, and a wild-type line, for which Cas9 was included on the T-DNA vector 
but outside the replicon. Double-junction PCR positives were recovered 
among the T, plants of both lines transformed with WDV GVRs, at frequencies 
ranging from 4.7% (in the wild type line) to 19.4% (in the transgenic line). 
Positives were also recovered from the non-GVR control treatment in the 
Cas9 line at 6.8%. No non-GVR control treatment was reported for the wild- 
type line. T, progeny or other analyses demonstrating germinal transmission 
were also not reported. 

Together, these results show that GVRs can enhance gene targeting in 
monocots, but do not yet present a convincing case that WDV vectors are 
simple to use for the recovery of targeted plants. Our own group and several 
collaborators have been unable to regenerate rice plants with knock-ins using 
WDY, even while targeting several different loci and applying selection for the 
T-DNA. We suspect Wang et al. succeeded in recovering transgenic plantlets 
largely by virtue of having imposed selection for the HDR event. The absence of 
additional reports reporting success suggests continued research is required 
to generalize the GVR strategy to cereals and other monocots. 


6 GVR-based editing strategies: summary and future trends 


In case studies described above and several additional reports, GVRs have 
been used to mediate HDR across species including tobacco, tomato, potato, 
cassava, wheat, and rice?67835-37.45.46 High-frequency, germline edits have been 
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achieved in dicots with BeYDV, whereas WDV has shown promise for HDR in rice 
and in wheat cell culture. For these and other plant systems, additional research 
will be required to optimize GVR-based editing strategies. For example, the 
current BeYDV-and WDV-derived replicons may not function in all target 
crops: GVRs from other viruses have failed to replicate in some species, likely 
indicative of virus-specific host range requirements”. In monocots, more work 
is needed to expand the utility of GVRs within extant tissue culture regimes to 
enable heritable edits to be recovered. These challenges may be addressed 
by leveraging increasing viral sequence resources made available through 
metagenomics’, or by more advanced engineering strategies for the delivery 
and control of GVRs. 


7 RNA viruses: mobile vectors broadly applicable for 
gene-editing reagent delivery 


RNA viral vectors are rapidly emerging as effective means for delivering 
genome engineering reagents. RNA viruses have small genomes with only 
a few essential genes required for replication, movement, and infection*”*°. 
These viruses have been used as vectors for protein production, viral-induced 
gene silencing, and more recently, gene editing’, The earliest reports of their 
use for gene editing involved using them to express a ZFN to edit an integrated 
reporter**. Vectors then quickly adopted the CRISPR/Cas9 technology by 
expressing sgRNAs for editing in a transgenic plant that expresses Cas9°. 
This was first demonstrated in Nicotiana benthamiana and then expanded into 
several other species®*°*, More recently, the technology was further improved 
to enable high frequencies of heritable editing in N. benthamiana”. Future work 
is required to expand heritable editing to other species and to develop vectors 
that no longer require transgenic plants for gene editing. Further advancement 
of the technology could allow transgene and tissue-culture-free gene editing 
of a variety of plant species. This would be a major advance for plant gene 
editing. 

RNA viruses, or viruses that do not have a DNA intermediate in their 
replication cycle, have many advantageous properties for use as vectors, 
including compact genomes, systemic plant movement, non-host integrating 
replication cycle, and well-studied interactions with host defense systems. 
Regardless of whether RNA viral genomes are mono or multipartite, they 
are relatively simple, containing only a few coding regions essential to their 
replication, movement, and infection (Fig. 2). Replication of the viral genome 
and expression of sub-genomic RNAs, which are translated into proteins, is 
dependent on the RNA-dependent RNA polymerase (RdRP)*°°%*. Cell-to-cell 
and systemic movement of RNA viruses relies on the movement protein (MP). 
RNA-viral MPs guide the viral genome to the plasmodesmata“ and often 
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TRV1 = RDRP +> VSR 


TRV2 CP Cargo 


Figure 2 Tobacco Rattle Virus (TRV) vector. TRV is a bipartite positive-sense single- 
stranded RNA virus. TRV1 contains the RNA-dependent RNA polymerase (RdRP) which 
replicates both TRV1 and TRV2 and expresses subgenomic RNAs of other coding regions. 
TRV1 also encodes the movement protein (MP), responsible for enabling cell-to-cell 
movement, and the viral suppressor of RNA silencing (VSR), responsible for suppression 
of the host immune response. TRV2 contains the coat protein (CP) and can be modified 
for expression of heterologous sequences. The native TRV2 contains coding sequences 
to enable transmission between plants, which are removed in the vector. Heterologous 
sequences are expressed from a subgenomic promoter, either duplicated from the same 
virus or from a closely related virus. 


increase the plasmodesmata size exclusion limit*®, permitting the full viral 
genome to pass from cell-to-cell. Systemic infection occurs by transport of the 
virus through the phloem, with unloading of the virus at distal sites, leading 
to systemic infection’. Some viruses require encapsidation, or coating of the 
viral genome with a viral encoded coat protein (CP), for systemic infection’. 
Encapsidation is also an important factor in vector transmission between plants 
by insects. Different plant RNA viruses may encode other factors involved in 
infection, transmission between plants, or other host interactions such as 
combating host defense. 

The well-studied genomes of several RNA viruses make them ideal for 
engineering and repurposing for use as vectors. Indeed, shortly after some 
of the first infectious cDNA clones of RNA viruses were created, efforts were 
undertaken to use them as vectors for expression of heterologous proteins®?°. 
Early approaches involved expression of reporter genes®', but vectors were 
quickly adapted for the expression of proteins of value for pharmaceutical or 
agricultural applications®’. 


7.1 Modifications of RNA viruses into RNA viral vectors 


One of the first steps in creating a plant RNA viral vector is to generate a cDNA 
of the viral transcript and adapt it for compatibility with various molecular 
cloning techniques. The cDNA vectors are often placed behind standard plant 
promoters. When transferred to plant cells, a transcript is produced that serves 
as the RNA virus genome; the transcript is replicated and enables systemic 
infection throughout the plant. A second modification commonly used with 
viral vectors is to incorporate them into a backbone that enables replication 
in Agrobacterium and T-DNA transfer into plant cells. Finally, the viral genome 
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itself is modified to express heterologous sequences (Fig. 2). This can involve 
viral gene replacement or gene insertion behind a subgenomic promoter®. For 
further reading on the construction of plant viral vectors we recommend Pasin 
et al.ć4, 


7.2 RNA viral vectors for virus-induced gene silencing 


Perhaps one of the best-known applications of RNA viral vectors is for viral- 
induced gene silencing (VIGS). VIGS functions by taking advantage of the 
native plant post-transcriptional RNA silencing (PTGS) mechanism. By placing 
a small fragment of a gene of interest into an RNA viral vector, host PTGS, in 
an attempt to silence the virus, will also silence the native gene®. This has 
proven a valuable approach for reverse genetics to better understand gene 
function, and VIGS vectors have been developed for a large number of plant 
species. A disadvantage of viral vectors for heterologous protein production 
or VIGS is that the effects of the assay are transient. Approaches to use viral 
vectors to create permanent genetic changes are outlined in the following 
sections. 


8 Case studies: use viral vectors to create permanent 
genetic changes 


8.1 Marton et al. 2010: Nontransgenic genome modification 
in plant cells®4 


With the advent of SSNs, RNA viral vectors were soon tested to determine if they 
could be used to express SSNs to create genetic modifications in infected plant 
tissues. One of the earliest examples of this is the use of Tobacco Rattle Virus 
(TRV) to express ZFNs for plant gene editing. Traditionally, genome-editing 
reagents are delivered to plant cells through stable transformation, wherein the 
SSN is introduced into plant cells by Agrobacterium or biolistics. Transgenic cells 
are then regenerated into whole plants through tissue culture, and progeny are 
screened for the presence of the targeted genome modification. Marton and 
colleagues described a method for non-transgenic gene editing of plant cells 
using a TRV expression system. TRV is a bipartite virus with two RNAs: TRV1 
and TRV2. TRV2 was modified to contain the Pea Early Browning Virus (PEBV) 
coat protein subgenomic promoter upstream of a multiple cloning site. This 
expression system was tested by first adding fluorescent reporter genes into the 
multiple cloning site. When plants were infected with these viruses, expression 
of the report(i.e., fluorescence) was observed in developing tissues throughout 
the plant. This indicated that TRV2 could be used to carry and express cargo in 
infected and growing portions of the plant. Next a ZFN monomer was added 
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to TRV2 that targets a defective gus reporter integrated into the genomes of N. 
benthamiana and petunia. Mutagenesis by the ZFN should create a functional 
GUS gene. GUS expression was observed in infected tissue in both species, 
indicating ZFN-mediated editing of the reporter. Finally, N. benthamiana and 
petunia tissue infected with TRV2 expressing the ZFN was placed onto tissue- 
culture medium to regenerate plants from the infected tissue. GUS activity was 
observed in the regenerated tissue, and the edited target was stably inherited. 
The seedlings were also confirmed to be free of virus, verifying the method as 
a non-integrating delivery means of generating stably maintained, site-specific 
gene edits. 

A longstanding goal for the use of RNA viral vectors for gene editing is to 
infect and edit endogenous genes in wild-type plants. This initial report of gene 
editing using RNA viruses was significant because it demonstrated editing 
in a plant by an RNA viral vector that expresses all the reagents required for 
gene editing. This was only made possible due to the relatively small size of 
a ZFN monomer. ZFN dimers are typically required to edit genes; however, 
the gus reporter had been engineered to carry two recognition sites for the 
ZFN monomer, thereby enabling gene editing by the monomer. Editing of 
endogenous genes is prohibited by the requirement of ZFN dimers for DNA 
cleavage and low-editing efficiencies®’. Viral vectors for genome editing need 
improvements to effectively target endogenous loci. 

Shortly after the previous case study was published, CRISPR/Cas9 
technology was developed® which promised significantly improved ease of 
gene editing. One disadvantage of the CRISPR/Cas9 system is that while the 
targeting component of the site-specific nuclease is relatively small, namely 
the sgRNA, Cas9 is a large protein and well beyond the carrying capacity of 
RNA viral vectors developed to date. RNA viral vectors have a limited carrying 
capacity due to the error-prone RdRP, which often truncates non-essential 
viral components; shorter, wild-type viruses quickly outcompete modified 
vectors. Further, vectors modified to express large cargo are often unable to be 
encapsidated®’. Ali and colleagues proposed a solution to this problem: they 
created a transgenic plant that expresses the Cas9 protein, and used the viral 
vector to express the sgRNA®'. Their results are highlighted in the following 
case study. 


8.2 Ali et al. 2015: Efficient virus-mediated genome editing in 
plants using the CRISPR/Cas9 system"! 


Recognizing delivery of genome engineering reagents to plant cells is a significant 
limitation to achieve gene editing, Ali and colleagues sought to provide a means 
to efficiently and quickly deliver CRISPR/Cas9 reagents using RNA viruses. They 
created transgenic N. benthamiana plants that express Cas9, and used TRV to 
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deliver sgRNAs that target unique sequences in the genome. TRV2 was modified 
in a similar manner to Marton et al. 2010 by expressing the sgRNA from a PeBV- 
subgenomic promoter. To test the effectiveness of this vector in gene editing, 
a sgRNA-targeting N. benthamiana phytoene desaturase (PDS) was cloned into 
the TRV2 vector and subsequently agroinfected along with a TRV1 vector into the 
transgenic N. benthamiana plants that express Cas9. Gene editing was observed 
in the inoculated and systemically infected leaves. The method was further 
validated by targeting a second locus, PCNA, and editing was again observed 
in the inoculated and systemically infected leaves. Next, multiple TRV2 vectors, 
each expressing a unique sgRNA, were simultaneously tested to determine if they 
could achieve multiplexed editing. TRV2 vectors expressing sgRNAs targeting 
PDS or PCNA were co-infected into N. benthamiana, and editing was observed at 
both loci, although at a lower frequency than when inoculated separately. 

The applicability of RNA viral vectors for gene editing would be greatly 
improved if the targeted mutation occurred in the plant germline. This would 
allow seeds containing the desired modification to be collected from the 
infected plant, thereby eliminating the need for tissue-culture in many gene- 
editing experiments and increasing ease and throughput. Ali and colleagues 
reported evidence of germinal transmission of a targeted mutation, although 
at a very low frequency. This indicated that future work was needed to increase 
germline edits to a frequency that is practical. Also, the approach needed to be 
expanded to species other than the N. benthamiana model. 


8.3 Expanding RNA viral vector gene editing to other species 


After Ali et al. demonstrated the ability of viral vectors to perform efficient gene 
editing in somatic cells of Cas9 expressing N. benthamiana, the same group 
and several others tested this approach for gene editing in other species and 
with other viral vectors. PEBV and Beet Necrotic Yellow Vein Virus (BNYVV) were 
demonstrated to perform gene editing in Cas9-expressing somatic cells of N. 
benthamiana“, and Tobacco Mosaic Virus (TMV) was used for multiplexed gene 
editing’®. TRV was also used to demonstrate gene editing in Cas9-expressing 
Arabidopsis®, highlighting the wide host range and utility of this virus. In order 
to expand to monocot species, Foxtail Mosaic Virus (FoMV) was modified to 
contain a duplicated coat protein subgenomic promoter for expression of 
sgRNAs. This vector was able to perform gene editing in Cas9 expressing N. 
benthamiana, Setaria viridis, and maize*. Finally, a Barley Yellow Striate Mosaic 
Virus (BYSMV) vector was modified to express proteins in wheat, barley, foxtail 
millet, and maize. BYSMV is a negative-sense RNA virus, whereas all other virus 
vectors discussed are positive-sense RNA viruses. Negative-sense RNA viruses 
could provide a larger carrying capacity than the positive-sense RNA viruses, 
as the authors of this study demonstrated expression of both the sgRNA and 
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Systemic 
viral spread 


Figure 3 A method for infecting plants with RNA viral vectors and generating heritable 
genome edits. (a) Transgenic plants expressing Cas9 are infected with an RNA viral vector 
expressing a mobile sgRNA to increase genome-editing frequencies. (b) This vector 
systemically infects the plant, interacting with endogenously expressed Cas9 for high- 
frequency genome editing (future methods may express all the genome-editing reagents 
from the virus, allowing editing of wild-type plants). Dashed lines and shading indicate 
genome editing. (c) Seeds are collected from infected plants and progeny are screened 
for editing of the targeted site. (d) High percentages of the progeny contain the desired 
edit; shaded seedlings indicate the presence of targeted mutations. 
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Cas9 from the virus and editing at the infiltrated site in N. benthamiana; they 
did not report any systemic editing’'. None of the previously mentioned studies 
demonstrated heritable editing without using tissue culture, the exception 
being Ali et al. 2015, in which, as mentioned above, heritable editing was 
observed at a very low frequency: only 1 in 438 seedlings tested. In order 
for viral vectors to be widely adopted as vectors for site-specific editing, the 
heritable editing frequency needs to be improved. Ellison and colleagues show 
improved heritable editing frequency in N. benthamiana by adding motifs that 
promote mobility of the sgRNAs expressed from TRV vectors (Fig. 3). Their 
results are described below. 


8.4 Ellison et al. 2020: Multiplexed heritable gene editing 
using RNA viruses and mobile single guide RNAs?” 


Ellison and colleagues sought to improve the frequencies of heritable gene 
editing with RNA viral vectors, recognizing that editing through infection 
could eliminate or reduce the need for tissue-culture - a major bottleneck in 
applying biotechnology to plant science research. The authors hypothesized 
that by enabling viral or sgRNA sequences better access to the germline, gene 
edits would be transmitted to the next generation at higher frequencies. To 
enable mobility, a Flowering Locus T (FT) motif was added to the 3’ end of 
sgRNA sequences. The FT motif was previously shown to promote mobility 
of RNAs into the meristem’?. sgRNA sequences targeting N. benthamiana 
PDS and augmented with this mobility sequence were cloned into the same 
TRV2 vector described in Ali et al. 2015. These vectors were agroinfected 
into Cas9-expressing N. benthamiana along with TRV1. Shortly after infection, 
phenotypes suggesting PDS mutagenesis emerged, and this was substantiated 
by determining that somatic editing frequencies were greater than 80%. 
Germinal transmission of these edits was observed by phenotyping (Fig. 4) 
and genotyping seedlings: up to 65% contained a mutation in at least one PDS 
allele. This high frequency of mutagenesis was demonstrated at a second locus, 
AGAMOUS (AG). In this case, up to 100% of seedlings contained a mutation 
in at least one AG allele. The mobility motif was not limited to FT: tRNA-like 
sequences have previously been shown to promote cell-to-cell mobility”, and 
such tRNA-like sequences were also demonstrated to enhance frequencies of 
heritable mutagenesis when fused to sgRNAs. In addition, heritable multiplexed 
editing of native N. benthamiana genes was achieved. Multiple sgRNAs could 
be expressed from a single TRV2 vector to achieve heritable mutations in all 
targets. This report establishes the ability of RNA viral vectors to perform high 
efficiency, heritable gene editing. Further work needs to be performed to 
expand this method to other species and to develop RNA vectors that are not 
reliant on a transgenic plant expressing Cas9. 
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Figure 4 Phytoene Desaturase (PDS) phenotypes in Cas9-expressing N. benthamiana 
plants infected with Tobacco Rattle Virus (TRV) vectors. TRV expresses mobile sgRNAs 
targeting the PDS coding sequence, which results in a bleached phenotype due to 
chlorophyll photo-oxidation. (a) Infected plants show an increase in the PDS-knockout 
phenotype as the plant matures. In some cases, the entire upper portion of the plant is 
bleached, indicating a PDS mutation occurring in meristematic cells. (b) Systemic plant 
leaves contain high frequencies of editing from RNA viral vectors expressing mobile 
sgRNAs. The leaf shown here contains PDS mutations in nearly every cell, causing an 
almost completely bleached phenotype. (c) Seeds are collected from plants infected with 
the TRV vector expressing mobile PDS sgRNAs. Several of the resulting seedlings are 
entirely bleached, indicating germline mutagenesis of the PDS locus. 


9 RNA viral vectors: conclusion and future trends 


Allthree of the case studies described in this chapter illustrate the growing interest 
inthe use of RNA viral vectors for plant gene editing. While there are considerable 
milestones yet to be achieved, these vectors provide considerable promise 
for tissue-culture-free gene editing. Future work to improve the technology 
should focus on expanding the species range of heritable gene editing. This will 
require optimization of vectors for each species along with the establishment 
of transgenic plant lines that express Cas9 at high levels and in the appropriate 
tissues. In order to eliminate the need for transgenesis, efforts should be made 
to test whether smaller Cas or other nucleases can be systemically transported 
by RNA viral vectors. Alternatively, it might be possible to increase the carrying 
capacity of virus vectors. Recently, Ma and colleagues developed vectors based 
on Sonchus Yellow Net Nucleorhabdovirus (SYNV), a negative-sense RNA virus. 


Published by Burleigh Dodds Science Publishing Limited, 2021. 


Viruses as vectors for the delivery of gene-editing reagents 21 


Remarkably, these vectors could deliver both Cas9 and sgRNAs’*. Gene-edited 
plants were recovered by regenerating infected plant tissue without selection, 
and over 90% of regenerated plants carried mutations. If SYNV could access 
the germline, perhaps through addition of RNA motifs that promote cell-to-cell 
mobility as described by Ellison et al.5”, it may be possible to recover heritable 
mutations solely through infection and without tissue culture and transgenesis. 

RNA viral vector utility can also be expanded by using different Cas 
variants with different PAM requirements to increase the targeting range or to 
create unique editing outcomes. It should also be possible to move beyond 
non-homologous end joining or microhomology-mediated repair of the target 
locus to achieve more precise modifications through base editing’*”’ or prime 
editing’®. Background work on RNA genome structure and function, followed 
by the development of vectors for protein expression, VIGS, and genome 
editing, will facilitate improvements and increase the scope, specificity, and 
broad applicability of RNA viral vectors. 


10 Where to look for further information 


Many viral vectors, including all of our group's published vectors, are available 
at Addgene, a non-profit plasmid repository. BeYDV and WDV replicons are 
among the available reagents, and they are compatible with the plant genome 
engineering toolkit described in Cermak et al.’’. Lastly, viral sequences can 
sometimes be challenging to assemble and clone in bacterial hosts, due to 
structural features or toxicity of viral gene products; for a review of strategies 
for viral clone assembly, see Pasin et al.%. 
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